114 research outputs found
Recommended from our members
BioC: a minimalist approach to interoperability for biomedical text processing
A vast amount of scientific information is encoded in natural language text, and the quantity of such text has become so great that it is no longer economically feasible to have a human as the first step in the search process. Natural language processing and text mining tools have become essential to facilitate the search for and extraction of information from text. This has led to vigorous research efforts to create useful tools and to create humanly labeled text corpora, which can be used to improve such tools. To encourage combining these efforts into larger, more powerful and more capable systems, a common interchange format to represent, store and exchange the data in a simple manner between different language processing systems and text mining tools is highly desirable. Here we propose a simple extensible mark-up language format to share text documents and annotations. The proposed annotation approach allows a large number of different annotations to be represented including sentences, tokens, parts of speech, named entities such as genes or diseases and relationships between named entities. In addition, we provide simple code to hold this data, read it from and write it back to extensible mark-up language files and perform some sample processing. We also describe completed as well as ongoing work to apply the approach in several directions. Code and data are available at http://bioc.sourceforge.net/. Database URL: http://bioc.sourceforge.net
Text mining for the biocuration workflow
Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on ‘Text Mining for the BioCuration Workflow’ at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community
Preferential regulation of miRNA targets by environmental chemicals in the human genome
<p>Abstract</p> <p>Background</p> <p>microRNAs (miRNAs) represent a class of small (typically 22 nucleotides in length) non-coding RNAs that can degrade their target mRNAs or block their translation. Recent disease research showed the exposure to some environmental chemicals (ECs) can regulate the expression patterns of miRNAs, which raises the intriguing question of how miRNAs and their targets cope with the exposure to ECs throughout the genome.</p> <p>Results</p> <p>In this study, we comprehensively analyzed the properties of genes regulated by ECs (EC-genes) and found miRNA targets were significantly enriched among the EC-genes. Compared with the non-miRNA-targets, miRNA targets were roughly twice as likely to be EC-genes. By investigating the collection methods and other properties of the EC-genes, we demonstrated that the enrichment of miRNA targets was not attributed to either the potential collection bias of EC-genes, the presence of paralogs, longer 3'UTRs or more conserved 3'UTRs. Finally, we identified 1,842 significant concurrent interactions between 407 miRNAs and 497 ECs. This association network of miRNAs-ECs was highly modular and could be separated into 14 interconnected modules. In each module, miRNAs and ECs were closely connected, providing a good method to design accurate miRNA markers for ECs in toxicology research.</p> <p>Conclusions</p> <p>Our analyses indicated that miRNAs and their targets played important roles in cellular responses to ECs. Association analyses of miRNAs and ECs will help to broaden the understanding of the pathogenesis of such chemical components.</p
BioCreative III interactive task: an overview
The BioCreative challenge evaluation is a community-wide effort for evaluating text mining and information extraction systems applied to the biological domain. The biocurator community, as an active user of biomedical literature, provides a diverse and engaged end user group for text mining tools. Earlier BioCreative challenges involved many text mining teams in developing basic capabilities relevant to biological curation, but they did not address the issues of system usage, insertion into the workflow and adoption by curators. Thus in BioCreative III (BC-III), the InterActive Task (IAT) was introduced to address the utility and usability of text mining tools for real-life biocuration tasks. To support the aims of the IAT in BC-III, involvement of both developers and end users was solicited, and the development of a user interface to address the tasks interactively was requested
Revisiting HIV-1 uncoating
HIV uncoating is defined as the loss of viral capsid that occurs within the cytoplasm of infected cells before entry of the viral genome into the nucleus. It is an obligatory step of HIV-1 early infection and accompanies the transition between reverse transcription complexes (RTCs), in which reverse transcription occurs, and pre-integration complexes (PICs), which are competent to integrate into the host genome. The study of the nature and timing of HIV-1 uncoating has been paved with difficulties, particularly as a result of the vulnerability of the capsid assembly to experimental manipulation. Nevertheless, recent studies of capsid structure, retroviral restriction and mechanisms of nuclear import, as well as the recent expansion of technical advances in genome-wide studies and cell imagery approaches, have substantially changed our understanding of HIV uncoating. Although early work suggested that uncoating occurs immediately following viral entry in the cell, thus attributing a trivial role for the capsid in infected cells, recent data suggest that uncoating occurs several hours later and that capsid has an all-important role in the cell that it infects: for transport towards the nucleus, reverse transcription and nuclear import. Knowing that uncoating occurs at a later stage suggests that the viral capsid interacts extensively with the cytoskeleton and other cytoplasmic components during its transport to the nucleus, which leads to a considerable reassessment of our efforts to identify potential therapeutic targets for HIV therapy. This review discusses our current understanding of HIV uncoating, the functional interplay between infectivity and timely uncoating, as well as exposing the appropriate methods to study uncoating and addressing the many questions that remain unanswered
Development of a quality indicator set to measure and improve quality of ICU care for patients with traumatic brain injury.
BACKGROUND: We aimed to develop a set of quality indicators for patients with traumatic brain injury (TBI) in intensive care units (ICUs) across Europe and to explore barriers and facilitators for implementation of these quality indicators. METHODS: A preliminary list of 66 quality indicators was developed, based on current guidelines, existing practice variation, and clinical expertise in TBI management at the ICU. Eight TBI experts of the Advisory Committee preselected the quality indicators during a first Delphi round. A larger Europe-wide expert panel was recruited for the next two Delphi rounds. Quality indicator definitions were evaluated on four criteria: validity (better performance on the indicator reflects better processes of care and leads to better patient outcome), feasibility (data are available or easy to obtain), discriminability (variability in clinical practice), and actionability (professionals can act based on the indicator). Experts scored indicators on a 5-point Likert scale delivered by an electronic survey tool. RESULTS: The expert panel consisted of 50 experts from 18 countries across Europe, mostly intensivists (N = 24, 48%) and neurosurgeons (N = 7, 14%). Experts agreed on a final set of 42 indicators to assess quality of ICU care: 17 structure indicators, 16 process indicators, and 9 outcome indicators. Experts are motivated to implement this finally proposed set (N = 49, 98%) and indicated routine measurement in registries (N = 41, 82%), benchmarking (N = 42, 84%), and quality improvement programs (N = 41, 82%) as future steps. Administrative burden was indicated as the most important barrier for implementation of the indicator set (N = 48, 98%). CONCLUSIONS: This Delphi consensus study gives insight in which quality indicators have the potential to improve quality of TBI care at European ICUs. The proposed quality indicator set is recommended to be used across Europe for registry purposes to gain insight in current ICU practices and outcomes of patients with TBI. This indicator set may become an important tool to support benchmarking and quality improvement programs for patients with TBI in the future
HIV-1 Protease and Reverse Transcriptase Control the Architecture of Their Nucleocapsid Partner
The HIV-1 nucleocapsid is formed during protease (PR)-directed viral maturation, and is transformed into pre-integration complexes following reverse transcription in the cytoplasm of the infected cell. Here, we report a detailed transmission electron microscopy analysis of the impact of HIV-1 PR and reverse transcriptase (RT) on nucleocapsid plasticity, using in vitro reconstitutions. After binding to nucleic acids, NCp15, a proteolytic intermediate of nucleocapsid protein (NC), was processed at its C-terminus by PR, yielding premature NC (NCp9) followed by mature NC (NCp7), through the consecutive removal of p6 and p1. This allowed NC co-aggregation with its single-stranded nucleic-acid substrate. Examination of these co-aggregates for the ability of RT to catalyse reverse transcription showed an effective synthesis of double-stranded DNA that, remarkably, escaped from the aggregates more efficiently with NCp7 than with NCp9. These data offer a compelling explanation for results from previous virological studies that focused on i) Gag processing leading to nucleocapsid condensation, and ii) the disappearance of NCp7 from the HIV-1 pre-integration complexes. We propose that HIV-1 PR and RT, by controlling the nucleocapsid architecture during the steps of condensation and dismantling, engage in a successive nucleoprotein-remodelling process that spatiotemporally coordinates the pre-integration steps of HIV-1. Finally we suggest that nucleoprotein remodelling mechanisms are common features developed by mobile genetic elements to ensure successful replication
- …